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Multi-valued networks provide a simple yet expressive qualitative state based modelling approach 
for biological systems. In this paper we develop an abstraction theory for asynchronous multi-valued 
network models that allows the state space of a model to be reduced while preserving key properties 
of the model. The abstraction theory therefore provides a mechanism for coping with the state space 
explosion problem and supports the analysis and comparison of multi-valued networks. We take as 
our starting point the abstraction theory for synchronous multi-valued networks which is based on 
the finite set of traces that represent the behaviour of such a model. The problem with extending 
this approach to the asynchronous case is that we can now have an infinite set of traces associated 
with a model making a simple trace inclusion test infeasible. To address this we develop a deci- 
sion procedure for checking asynchronous abstractions based on using the finite state graph of an 
asynchronous multi-valued network to reason about its trace semantics. We illustrate the abstraction 
techniques developed by considering a detailed case study based on a multi-valued network model 
of the regulation of tryptophan biosynthesis in Escherichia coli. 



1 Introduction 

Multi-valued networks (MVNs) ll25l[34l[35l are an expressive qualitative modelling approach for biolog- 
ical systems (for example, see |[35l |7l |28j |3l). They extend the well-known Boolean network WJ\ [T8l 
approach by allowing the state of each regulatory entity to be within a range of discrete values instead 
of just true ox false. The state of each regulatory entity is influenced by other regulatory entities in the 
MVN and entities update their state using either a synchronous update strategy |[T8ll39l where all entities 
simultaneously update their state, or an asynchronous update strategy |[33l [151 [36l where entities update 
their state independently using a non-deterministic approach. 

While MVNs have shown their usefulness for modelling and understanding biological systems fur- 
ther work is still needed to strengthen the techniques and tools available for MVNs. One interesting 
area that needs developing is a theory for abstracting MVNs. Abstraction techniques allow a simpler 
model to be identified which can then be used to provide insight into the more complex original model. 
Such techniques are well-known in the formal verification community as a means of coping with the 
complexity of formal models (see for example ||9l|6l[l0l[T3l). The main motivation behind developing 
such a theory for MVNs can be summarised as follows: 

(1) The analysis of MVNs is limited by the well-known problem of state space explosion. Using ab- 
straction is one useful approach which allows analysis results from a simpler approximate model to infer 
results about the original model. 

(2) Often several MVNs are defined at different levels of abstraction when modelling a system. It is 
therefore clearly important to be able to formally relate these models using an appropriate theory. 

(3) An abstraction theory would provide a basis for the step-wise refinement of MVNs. 



Proceedings of 5th Worksiiop on Membrane Computing 
and Biologically Inspired Process Calculi (MeCBIC 2011) 
Pages 95-111, 2011. 



© L. lason Steggles 
All the rights to the paper remain with the authors. 



(4) Identifying an abstraction for a complex MVN provides a means of better visualising and understand- 
ing the behaviour an MVN, giving greater insight into the system being modelled. 

The abstraction theory we present for asynchronous MVNs is based on extending the synchronous 
abstraction theory presented in Q. We formulate a notion of what it means for an MVN to be correctly 
abstracted by a simpler MVN with the same network structure but smaller state space. The idea is to 
use an abstraction mapping to relate the reduced state space of an abstraction to the original MVN. An 
abstraction is then said to be con^ect if its set of traces is within the absti'acted ti^aces of the original MVN. 
This definition of abstraction represents an under-approximation ^ |24l since not all of the behaviour 
of the original MVN is guaranteed to have been captured within the abstraction. We show that this 
approach allows sound analysis inferences about positive reachability properties in the sense that any 
reachability result shown on an abstraction must hold on the original model. An important result of this 
is that it therefore follows that all attractors of an asynchronous abstraction con^espond to attractors in 
the original MVN. Note that an alternative approach commonly used in abstraction is to use an over- 
approximation in |24l [TOl in which false positives may occur. However, such an approach appears to be 
problematic for MVNs and we discuss this further in Section [3] 

The non-deterministic nature of asynchronous MVNs mean that we encounter additional complica- 
tions compared to the synchronous case; an asynchronous MVN can have an infinite set of traces which 
means that directly checking trace inclusion to check a proposed abstraction is infeasible. We overcome 
these difficulties by constructing a decision procedure for checking asynchronous abstractions that is 
based on the underlying finite state graph of an MVN. We introduce the idea of step terms which are 
used to denote possible ways to use sets of concrete states to represent abstract states. The decision 
procedure starts with the set of all possible step terms and then iteratively prunes the set until either a 
consistent abstract representation has been found or the set of remaining step terms is too small to make 
it feasible to continue. We provide a detailed proof that shows the decision procedure correctly identifies 
asynchronous abstractions and discuss the complexity of the decision procedure. 

We illustrate the abstraction theory we develop by considering a case study based on modelling the 
regulatory network that controls the biosynthesis of tryptophan by the bacteria E. coli 1291 l27l . Trypto- 
phan is essential for the development of E. coli and its resource intensive synthesis is carefully controlled 
to ensure its production only occurs when an external source is not available. We investigate identi- 
fying asynchronous abstractions for an existing MVN model of this regulatory mechanism which was 
developed in [i30i . 

The paper is organized as follows. In Section |2] we provide a brief overview of the MVN modelling 
framework and present a simple illustrative example. In Section [3] we formulate a notion of abstraction 
for asynchronous MVNs and consider the analysis properties that can be infen^ed from an abstraction. In 
SectionlUwe present a decision procedure for checking asynchronous abstractions and provide a detailed 
proof of correctness for this procedure. In Section [5] we illustrate the theory and techniques developed 
by a case study based on modelling the regulatory network that controls the biosynthesis of tryptophan 
by E. coli. Finally, in Section [6] we present some concluding remarks and discuss related work. 

2 Multi-valued Network Models 

In this section, we introduce multi-valued networks (MVNs) l25l [34l [351 . a qualitative modelling ap- 
proach which extends the well-known Boolean network ITTl [TSl approach by allowing the state of each 
regulatory entity to be within a range of discrete values. MVNs can therefore discriminate between 
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the strengths of different activated interactions, something which Boolean networks are unable to cap- 
ture. MVNs have been extensively studied in circuit design (for example, see |[25ll20l ) and successfully 
applied to modelling biological systems (for example, see |[35i r7ll28l[3l). 

An MVN consists of a set of logically linked entities G = {gi ,gk} which regulate each other in 
a positive or negative way. Each entity gi in an MVN has an associated set of discrete states Y{gi) = 
{0, for some m,- > 1, from which its cun^ent state is taken. Note that a Boolean network is 
therefore simply an MVN in which each entity gi has a Boolean set of states Y{gi) = {0, 1}. Each entity 
gi also has a neighbourhood A'^(^,) = {^,1 , . . . } which is the set of all entities that can directly affect 
its state. A given entity gi may or may not be a member of A'^(^,) and any entity in which N{gi) = {} is 
taken to be an input entity whose regulation is outside the current model. The behaviour of each entity gi 
based on these neighbourhood interactions is formally defined by a logical next-state function fg. which 
calculates the next-state of gi given the cuiTcnt states of the entities in its neighbourhood. 

We can define an MVN more formally as follows. 

Definition 1. An MVN MV is a four-tuple MV = {G, Y,N,F) where: 

i) G = {gi , . . . is a non-empty, finite set of entities; 

ii) Y = {Y{gi),. . . ,Y{gk)) is a tuple of state sets, where each Y{gi) = {0, . . . ,m;}, for some m, > 1, is the 
state space for entity g,; 

iii) = {N{gi),. . . ,N{gk)) is a tuple of neighbourhoods, such that N{gi) C G is the neighbourhood of 
gh and 

iv) F = {fg^ ,...,fgi^) is a tuple of next-state multi-valued functions, such that if N{gi) = {gi^ , . . . 
then the function fg. : ^(g,-, ) x • • • x Y{giJ — )• Y{gi) defines the next state of gi. □ 

Consider the following simple example PL2 of an MVN defined in Figured] which models the core 
regulatory mechanism for the lysis-lysogeny switch |[34l l23l in the bacteriophage A (this model is taken 
from ||32l). It consists of two entities CI and Cro, defined such that Y{CI) = {0, 1} and Y{Cro) = 
{0, 1,2}. The next-state functions for each entity are defined using the state transition tables presented in 
Figure [11(b) (where [gi] is used to denote the next state of entity gi). We can summarise the interactions 
as follows: entity Cro inhibits the expression of CI and at higher levels of expression, also inhibits itself; 
entity CI inhibits the expression of Cro while promoting its own expression. 

In the sequel, letMV = {G,Y,N,F) be an arbitrary MVN. In a slight abuse of notation we let gi G MV 
represent that gi G G is an entity in MV. 
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(b) State transition tables 



Figure 1: The MVN model PL2 of the core regulatory mechanism for the lysis-lysogeny switch in 
bacteriophage A (taken from ll32l ). 
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A global state of an MVN MV with k entities is represented by a tuple of states (i'l, . . . ,Sf-), where 
Si G Y{gi) represents the state of entity gi € MV. As a notational convenience we often use ...si: to 
represent a global state {s\,... ,Sk). When the current state of an MVN is clear from the context we let g,- 
denote both the name of an entity and its corresponding current state. The global state space of an MVN 
MV, denoted Smv^ is the set of all possible global states Smv = Y{gi) x ■■■ x Y{gic). 

The state of an MVN can be updated either synchronously (see |[T8ll39l ). where the state of all enti- 
ties is updated simultaneously in a single update step, or asynchronouslxl {see |[33l[T5l ). where entities 
update their state independently. We define these update strategies more formally as follows: 

Definition 2. 

1) Synchronous Update: Given two states Si,S2 € Smv^ we let Si — )• 52 represent a synchronous update 
step such that S2 is the state that results from simultaneously updating the state of each entity gi using its 
next-state function fg. and the appropriate states from as indicated by the neighbourhood N{gi). 

2) Asynchronous Update: For any gj G MV and any state S G Smv we let [S]^' denote the global state that 
results by updating the state of gi in S using fg.. Define the global state function next^^ : Smv ^{Smv) 
on any state S G Smv by 

next^^iS) = {[Sf I gi G MV and [Sf / S} 
Given a state Si G Smv and ^2 G next^^{Si), we let Si — > ^2 represent an asynchronous update step. □ 

Note that given the above definition, only asynchronous update steps that result in a change in the 
current state are considered (see |[T5l ). 

Continuing with our example, consider the global state 12 for PL2 (see Figure [T]) in which CI has 

state 1 and Cro has state 2. Then 12 01 is a single synchronous update step on this state resulting in 

the new state 11. Considering an asynchronous update, we have next^^ (12) = {02, 11} and 12 — > 02 

and 12 — > 11 are valid asynchronous update steps. 

The sequence of update steps from an initial global state through Smv is called a trace. In the case of 
the synchronous update semantics such traces are deterministic and infinite. Given that the global state 
space is finite, this implies that a synchronous trace must eventually enter a cycle, known formally as an 
attractor cycle II181I35I . 

Definition 3. A synchronous trace a is a list of global states o = {So,Si,S2,- ■ ■), where Si — > Si^i, for 
/ > 0. □ 

The set of all synchronous traces, denoted Tr^{MV), therefore completely characterizes the be- 
haviour of an MVN model under the synchi^onous semantics and is referred to as the synchronous trace 
semantics of MV. Note that we have one synchi^onous ti'ace for each possible initial state and so the set 
of synchronous traces is always finite (see |[T8l[39l ). 

In the asynchronous case, traces are non-deterministic and can be finite or infinite. A single initial 
state can have an infinite number of possible asynchronous traces starting from it and thus in the asyn- 
chronous case there can be infinite number of traces. 



'Note that different variations of the asynchronous semantics have been considered in the literature (see for example 1261 ) 
but that we focus on the one most commonly used for MVNs. 
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(a) Synchronous (b) Asynchronous 

Figure 2: The (a) synchronous and (b) asynchronous state graphs for PL2. 



Definition 4. An asynchronous trace o is either: 

i) a finite sequence of global states a = (5o,5i , . . . ,5„), where Si — > for / = 0, . . . ,n — 1, and 

next"^{Sn) = {}. 

Asy 

ii) an infinite sequence of global states a = (So,Si,S2, . . . ), where 5,- — ^ Si+i, for / > 0. □ 

The set of all asynchronous traces, denoted Tr^{M'V), therefore completely characterizes the be- 
haviour of an MVN model under the asynchronous semantics and is refeiTcd to as the asynchronous trace 
semantics of MV. Any state S G Smv which cannot be asynchronously updated, i.e. next^^ (S) = {}, is 
referred to as a point attractor ||34l . 

In our running example, PL2 has a state space of size {Sptzl = 6 and has the following (finite in this 
case) set of asynchronous traces: 

(00,01,02,01,02,...) (10) 

(00,10) (11,01,02,01,02,...) 

(01,02,01,02,...) (11,10) 

(02,01,02,01,...) (12,02,01,02,01,...) 

From the above traces it is clear that state 10 is a point attractor for PL2. 

The behaviour of an MVN under the synchronous or asynchronous trace semantics can be represented 
by a state graph (for example, see |[36l ) in which the nodes are the global states and the edges are 



precisely the update steps allowed. We let SG^{MV) = {Smv, ^) and SG^{MV) = {Smv, ^) denote 
the corresponding state graphs under the synchronous and asynchronous trace semantics. 
The synchronous and asynchronous state graphs for PL2 are presented in Figure |2] 
When analysing the behaviour of an MVN it is important to consider its attractors which can rep- 
resent important biological phenomena, such as different cellular types like proliferation, apoptosis and 
differentiation |[T6l . In the synchronous case all traces are infinite and so must lead to a cyclic sequence 
of states which are taken as an attractor ifTSl l35l l39l . As an example, consider PL2 (see Figure |2] (a)) 

which has the point attractor 10 ^ 10; and attractors 00 — > 11 — > 00 and 01 02 — > 01 of pe- 
riod 2. In the asynchronous case we have point attractors which are states that cannot be updated and 
also the strongly connected components in an MVN's asynchronous state graph are considered to be 
attractors |[36l . Again, considering PL2 (see Figure [21(b)) we can see that in the asynchronous case it 

has two point attractors, 01 and 10, and one attractor 01 — 4> 02 — ^ 01. 
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3 Asynchronous Abstractions 



In this section we consider developing a notion of absti^action for asynchronous MVNs. The idea is to 
formulate what it means for an MVN to be correctly abstracted by a simpler MVN with the same network 
structure but smaller state space. We take as our starting point the abstraction techniques developed for 
synchronous MVNs f5\ and investigate extending these to the asynchi^onous case. We show that our 
approach allows sound analysis inferences about positive reachability properties and that all attractors of 
an asynchronous abstraction correspond to attractors in the original MVN. 

We begin by recalling the notion of a state mapping and abstraction mapping lH used to reduce an 
entity's state space. 

Definition 5. Let MV be an MVN and let gj e MV be an entity such that Y{gi) = {0, . . . ,m} for some 
m > 1. Then a state mapping ^{gi) for entity gj is a surjective mapping 0(^,) : {0, . . . — > {0, . . . ,n}, 
where <n <m. □ 

The state mapping must be surjective to ensure that all states in the new reduced state space are used. 
From a biological viewpoint it may also be reasonable to further restrict the state mappings considered, 
for example, only considering those mappings which are order-preserving. Note we only consider state 
mappings with a codomain lai^ger than one, since a singular state entity does not appear to be of biological 
interest. 

As an example, consider entity Cro G PL2 (see Figure[T|) which has the state space Y [Cro] = {0,1,2}. 
It is only meaningful to simplify Cro to a Boolean entity and so one possible state mapping to achieve 
this would be: 

0(Cro) = {0 0, 1 1,2 1}, 

which maps state to and merges states 1 and 2 into a single state 1. 

In order to be able to simplify several entities at the same time during the abstraction process we 
introduce the notion of a family of state mappings. 

Definition 6. Let MV = {G,Y,N,F) be an MVN with entities G = {gi,... ,gk}. Then an abstrac- 
tion mapping <p = {(}){gi),...,(}){gk)) for MV is a family of mappings such that for each 1 < / < ^ we 
have 0(g() is either a state mapping for entity gi or is the identity mapping Ig. : Y{gi) Y{gi) where 
IgX^) = for all s G Y{gi). Furthermore, for (j) to be useful we normally insist that at least one of the 
mappings is a state mapping. □ 

Note in the sequel given a state mapping <p{gi) we let it denote both itself and the corresponding 
abstraction mapping containing only the single state mapping (j){gi). 

An abstraction mapping ^ can be used to abstract an asynchronous trace (see Definition |4l) using a 
similar approach to that detailed for synchi^onous traces |i5j|. We begin by defining how an abstraction 
mapping can be lifted to a global state. 

Definition 7. Let = {(}){gi) . . .<p{gk)) be an abstraction mapping for MV. Then ^ can be used to 
abstract a global state si ...s^ ^ Smv by applying it pointwise, i.e. ^(^i ...st) = (l){gi){si) . . .(I){gji){sk)- 

□ 

We can apply an abstraction mapping ^ to an asynchronous trace cr G Tr^{MV) by applying ^ to 
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each global state in the trace in the obvious way and and then merging consecutive identical states. Note 
that removing consecutive identical states is needed since by the definition of an asynchronous trace (see 
Definition ID) each asynchronous update rule must result in a new global state, i.e. the state of an entity 
has to change in order for a state transition to occur. 

Definition 8. Let (j) = (0(gi) • • - (pigk)) be an abstraction mapping for MV and let a € Tr^{MV) be 
either a finite a = {So,Si, . . . ,Sn) or infinite a = {So,Si,S2, ■ ■ ■) asynchronous trace. Then 0(cj) is the 
abstracted trace that results by 

i) First apply the abstraction mapping to each state in a, i.e. in the finite case {(j){So),(j){Si), . . . ,(j) (5,, ) ) 
or in the infinite case {(j) {Sq) , (j) (Si) , (j) {S2) , ■ ■■)■ 

ii) Next merge consecutive identical global states in the trace into a single global state to ensure that no 
two consecutive states are identical in the resulting abstracted trace, i.e. suppose the result is an infinite 
trace (^(5o), ^(^i ), ^(52), . . . ) then we know that for / € N we have ^(5,) 7^ □ 

We let (j){Tr^{MV)) = {0(a) | a G Tr^{MV)} denote the set of abstracted traces. 

As an example, consider applying the abstraction mapping <p{Cro) = {0i->0, li— )-l,2i->l}to 
the PL2 asynchi^onous trace (00,01,02,01,02,...). Part i) of Definition [8] above results in the trace 
(00,01,01,01,01, . . .); we now merge identical consecutive states to derive the abstracted trace (00,01). 
It is interesting to note that abstracting an infinite trace can result in a finite abstracted trace, as above. 
The intuition here is that a cyclic set of states have been abstracted to a single point. The complete set of 
abstracted asynchronous traces of PL2 using ^{Cro) are given below: 



(/)(Cro)((00,01, 02,01, 02,.. 


•)) 


= (00,01,) 


(^(Cro)((00,10)) 




= (00, 10) 


(^(Cro)((01, 02,01, 02,...)) 




= (01) 


0(Cro)((O2,Ol, 02,01,...)) 




= (01) 


0(Cro)((lO)) 




= (10) 


0(Cro)((ll,Ol,O2,Ol,O2,.. 


•)) 


= (11,01) 


(^(Cro)((ll,10)) 




= (11,10) 


0(Cro)((12,O2,Ol,O2,Ol,.. 


•)) 


= (11,01) 



The definition of an asynchronous abstraction is based on its trace semantics and follows along sim- 
ilar lines to that for the synchronous case (5"]. We say an asynchronous abstraction is correct if its set 
of traces is within the abstracted traces of the original MVN. This definition of abstraction represents an 
under-approximation [] since not all of the behaviour of the original MVN is guai^anteed to have been 
captured within the abstraction (we discuss the implications of this below). 

Definition 9. Let MVi = {GuYi^NuFi) and MV2 = {GiJi^Ni.Fi) be two MVNs with the same 
structure, i.e. G\ = G2 and Ni{gi) = N2{gi), for all gi € MVi. Let <p be an abstraction mapping from 
MV2 to MVi. Then we say that MVi asynchronously abstracts MV2 under (/), denoted MVi <I^MV2^ if^ 
and only if, Tr^ {MVi ) C ( Jr^ {MV2 ))■ □ 

As an absti^action example, consider the MVN APL2 defined in Figure [3] which has the same structure 
as PL2 (see Figure[Tl) but is a Boolean model. Then given the abstraction mapping (Cro) = {0 1-^ 0, 1 1-^ 
1,2 1} we can see that Tr^{APL2) C ^{Cro){Tr^{PL2)) holds and so APL2 is an abstraction of PL2, 
i.e. APL2 <|^*^*'™^ PL2 holds. Note that APL2 has two point attractors: 01 and 10 which con^espond 
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Figure 3: State transition tables defining APL2 and associated asynchronous trace semantics Tr^ {APL2). 



to the two attractors associated with PL2 (see Figure |2](b)) and thus, APL2 can bee seen to be a good 
approximation of the behaviour of PL2. 

Recall that one of the original motivations for developing an abstraction theory was to aid the anal- 
ysis of complex MVNs. It is therefore important to consider what properties of an asynchronous MVN 
can be inferred from an abstraction MVN. We consider reachability and the existence of attractors since 
these are the main properties that are considered when analysing an MVN. 

Theorem 10. Let MV\ <\mV2 and let 5'i,5'2 G Suvr If "^2 is reachable from Si in MVi then there 
must exist states S\,S'2 € Smv2 such that ^{S\) = Sy, <^{S'2) = S2, and ^2 is reachable from S\ mMV2- 

Proof. Since S2 is reachable from 5i there must exist a trace a G Tr^ {MV\ ) which begins with state 
and which contains state ^2. From Definition |9l we know that Tr^{MVi) C ^{Tr^{MV2)) must hold. 
Therefore there must exist a trace a' € Tr^{MV2) such that (j){a') = a. From this it is straightforward 
to see that there must exist the required states S[ and ^2 in a' such that ^{S\) = S\, ^{82) = S2, and ^2 is 
reachable from S[ . □ 



The above theorem indicates that inferring reachabiUty properties from an abstraction is sound but not 
complete |[T3l . The impUcations of this can be summarised as follows: (i) If one state is reachable from 
another in an abstraction then a corresponding reachability property must hold in the original model; (ii) 
However, if one state is not reachable from another in an abstraction then a corresponding reachability 
property in the original MVN may or may not hold and more analysis will be required. This relates to the 
fact that our notion of abstraction represents an under-appmximation 1911241 of the original model. The 
alternative approach would be to use an over-appmximation abstraction model |[9l|24l[T0l in which false 
positives can arise and need to be dealt with. It turns out that an over-approximation approach is not 
well suited to MVNs given that our goal is to find an abstraction model that is a well-defined MVN. To 
illustrate the potential problems, consider what happens if a point attractor is identified to a non-attractor 
state by an abstraction mapping. In this case no over-approximation abstraction can exist since such an 
MVN would need to contain a state that was both a point attractor and also had a successor state. Thus 
the approach taken here of using an under-approximation appeai^s to be the appropriate approach to use. 

Note that a consequence of the above is that all attractors in an abstraction must have corresponding 
attractors in the original MVN. 

Corollary 11. If MV\ <l^ MV2 then all attractors of MVi must represent attractors in MV2- 

Proof. Follows directly from the definition of an attractor and Theorem [TOl □ 
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4 A Decision Procedure for Asynchronous Abstractions 



Given we have now formulated a definition of an asynchronous abstraction we are now interested in 
defining a procedure for checking whether a proposed abstraction MVi is an asynchronous abstraction 
of an MVN MVz- In the synchi^onous case the approach taken was to simply check that each trace 
a G Tr^{MVi) was contained within the set of abstracted traces (}){Tr^ {MVz))- However, in the asyn- 
chronous case both sets of traces Tr^{MVi) and ^{Tr^{MV2)) may be infinite and so such a simple set 
inclusion check is not feasible. Instead we propose a decision procedure based on using the state graphs 
that summarise the behaviour of an asynchronous MVN. The idea is to consider all sets of states and 
associated edges that can be used to model an abstract state. We then iterate through these removing 
those state sets which can not be represented given the cun^ent allowable state sets. If at any point we no 
longer have any state sets remaining for a particular abstract state then we have shown the abstraction is 
not valid and we terminate the decision procedure. If, on the other hand, we reach a point at which no 
more state sets can be removed then we know the abstraction must be valid and we can again terminate 
the procedure. 

In the sequel let MVi and MV2 be MVNs with the same structure and let (j) be an abstraction mapping 
from MV2 to MV\ . 

In order to define a decision procedure checkAsynAbs (MVi ,MV2 , 0) for checking if MVi is an 
asynchronous abstraction under (j) of MV2 we begin by formulating some preliminary concepts. 

i) Representing abstract states: Let S e Y(MVi) then we define 

(j)-\S) = {S'\S'eY{MV2), (^{S') = S} 
to be the set of all states in MV2 that can represent the abstract state S. 

ii) Set of identical consecutive states: For any state S' G Y{MV2) we define the set [5']*^ of all con- 
secutive reachable states from S' that have the same abstract state 0(5"). Define [5"]'^ = U/eNi'^lf > where 
[S']f is defined recursively: [S']q = {S'} and 

[S'ti = {S'2 \ S[ e [S']f, S'2 G nexf^^n^'i), Hs') = HS'2)}- 

We now define the notion of a step term, an expression which is used to represent one possible way to 
model an abstract state using a set of original states. Such step terms will form the basis of our decision 
procedure. 

Definition 12. Let S G Y{MVi) and suppose next^^^{S) = {Si, . . . ,S,„}- Then for each non-empty 
set of states F C (5) we define the step term st{S,r) by 

st{S,r) = [S:r:DiSi),...,DiS,„)], 

where D(5,-) = {S'2 \ S[ G F, 5^ G next"^^{{S[} U K]'^), 0(5^) = Si}, and next'^^^ has been lifted from 
taking a single state as input to taking a set of states in the obvious way. Note that the use of is 
needed in the above definition to take account of the merging of consecutive identical states that occurs 
in abstracted traces (see part ii) in Definition [8]l. 

We say a step term [5 : F : D{S\),. . . ,D{Sm)] is valid iff: 
i) the states F used in a step term have the appropriate connections, i.e. D{Si) 7^ {}, for / = 1, . . . ,m; and 
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ii) if 5 is a point attractor in SC^lMVi) then it must be modelled by point attractors in SG'^{MV2) 
(discounting steps to identical abstracted states), i.e. if next^^^{S) = {} then for each 5' G F we have 

next^^^{[Sf)-[Sf = {}. □ 



We let Step{S) denote the set of all valid step terms 

Step{S) = {st{S,T) I r C (5), st{S,T) is valid}. 

Observe that each valid step term st{S,T) G Step{S) must correctly model in MV2 the connections 
between 5 G Y{MVi) and its corresponding next states next'^^^ (S) in MVi. 

The proposed decision procedure is presented in FigurelH It works by creating a family C = (C(S) C 
Step{S) I S G Y{MVi)) of sets of all valid step terms. It then repeatedly looks at each set of step terms 
C{S), for each abstract state S G Y{MVi), removing those that have next states that are not currently in 
the remaining stored step terms of C. 

Algorithm checkAsynAbs (MVi ,MV2 , 0) : 

/** Initialise valid state terms **/ 

for each SeY{MVi) do C{S)=Step{S) 

/** Iteratively check sets of step terms **/ 

repeat 

done : =true 

for each SeY{MVi) do 

for each [5 : T : D(5i), . . . ,D(5™)] G C(5) do 
for /:= 1 to m do 

if st{Si,D{Si)) ^C{Si) then 

C{S) = C{S) - {[5 : r : D(5i), . . . ,D(5,„)]} 
done : =f alse 
if C{S) = {} then return false 
until (done = true) 
return true 



Figure 4: Decision procedure for checking asynchronous abstractions MVi <i^MV2- 

It is straightforwai^d to show that the decision procedure must always terminate. 

Theorem 13. The decision procedure checkAsynAbs (MVi ,MV2 , (/>) always terminates. 

Proof. This follows from that fact we can only ever begin with a finite family of finite sets of step 
terms, that no step terms can ever be added, and that we must remove at least on step term in order to 
continue to the next iteration. Therefore the algorithm either terminates when no step terms are removed 
or continues to iterate until we reach a point where one set C{S) of step terms is empty, again resulting 
in termination of the algorithm. □ 

The complexity of the decision procedure in the worst case, when MVi is not an asynchronous 
abstraction of MV2, can be derived as follows. Assume MV\ is a Boolean model which has n entities 
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and k is an upper bound on the number of states in MV2 that can be abstracted to a single state in MVi, 
i.e. k > |^^'(5')|, for all S € Y{MVi). Note that k can be calculated from the abstraction mapping used 
and is not dependent on n. The three nested for loops in the decision procedure have an upper bound of 
0(2" X 2*^ X n) where: 2" is the number of states in MVi ; 2* is an upper bound on the number of different 
sets of states that can be mapped to a given abstract state; and n represents the maximum number of states 
that can be connected to a given state. The outer repeat until loop will iterate round removing a single 
step term until one of the step term sets is empty. This gives a final upperbound of C?(2^("+*) x n^). In 
practice the decision procedure should perform much better than this. Note that for a given abstraction 
mapping, k can be seen as a fixed constant which does not increase as entities are added (providing the 
state of those entities is not abstracted). 

Let [S :r:D{Si),. . . ,D{Sm)] be a vahd step term, let ai G Fand a2 GD(5,), for some 1 < / < m. Then 

note that due to the way consecutive identical states are treated it may not directly hold that tti — > a2 

since a2 G next'^^^- {{ai } U [ai]'^). We let oT = ai ^ a/ ^ ... ^ «f, for a( G [ai]*^, for 1 < 7 < r 

represent the sequence of identical abstracted states needed such that Oi a2 does hold in MV2. 

The following lemma considers how step terms can be chained together and is is needed to prove the 
main correctness result below. 



Lemma 14. Let C = {C{S) C Step{S) \ S G Y{MV\)) be a family of sets of valid step terms such 
that: 

i) For each S G Y{MVi) we have C{S) / {}; 

ii) The family C is closed under step terms, i.e. for each S G Y{M'V\ ) and [S:Y : D(5i ), . . . ,D(5m)] G C{S) 
we have st{Si,D{Si)) G C(5;), for I <i <m. 

Then every pathH Y = Yi — > . . . Yp the abstraction state graph SG^{MVi) must have a corre- 
sponding path a = ai . . . — > a,-, r > p, in the original state graph SG {MV2) such that 0(o;) = 7. 

Proof. 

Let Y = Yi — ^ ■ ■ ■ Yp be a path in the state graph SG {MV\). Then by assumptions i) and ii) it is 
straightforward to see there must exist a (not necessarily unique) chain of step terms 

[Y : F,- : . . . ,D(7+i), . . .] G C{y), st{Yp,rp) G C(7p) 

for 1 < / < p, such that for j = 2,. . . ,p we have Fy = D{Yj)- 

We now prove that for any G F^ there must exist a,- G F,, for I < i < p, such that a = of — > 

■ ■ ■ — > OCp-i — ^ ap is a path in SG {MV2) with ^{(x) = 7. We prove this using induction on G N, 
;? > 2 as follows. 

1) Induction Base. Let p = 2 and suppose we have a path 71 72. Then we know there must ex- 
ist step terms [71 : Fi : . . . ,0(72), . . .] G C{yi) and st{Y2,D{Y2)) G C(72) (as explained above). Clearly by 

the c 
and 



the definition of step terms we know that for any 0I2 G D(]^) there must exist a\ G Fi such that ai a2 



(j){ai — ^ a2) = 7i — ^ 72- 



^We note that a path differs from a trace in that a trace represents a complete run of an MVN whereas a path is simply a 
walk through an MVN's state graph. 
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2) Induction Step. Let p = q + I, for some ^ G N, g > 2. Suppose we have a path 7 = 7i — '-^ ■ ■ ■ — ^ 
Yq 7^+1- Then we know there must exist step terms 

[71 : Ti : . . . ,D{Y2),- ■ ■] G C(7i), [7 : 0(7) : . . . ,D(7+i), . . .] e C(7), .?(7^+i,£»(7^+i)) € C(7,+,), 

for 2 < / < g (as explained above). Then by the induction hypothesis we know for each € D{Yq) there 
must exist at G F,-, for 1 < / < ^, such that oT — ^ • • • — -> a^-i — > oCq is a path in SG'^{MV2) with 

, , Asy Asy Asy . Asy Asy T-.iir--- r -i-n i r 

(j>{a\ — > ■ ■ ■ — Uq-i — ^ Uq) = 7i — 4- • • • — ^ 7y. By the derimtion ol step terms it loUows that lor 

any Uq+i G 0(7^+1) there must exist Uq G Fq such that cCq+i- Combining this with the induction 

hypothesis given above shows the existence of the required path in SG^{MV2)- D 

It now remains to show that the decision procedure checkAsynAbs (MVi ,MV2 , </>) correctly checks 
for asynchronous abstractions. 

Theorem 15. checkAsynAbs (MVi MVi returns true if, and only if, MVi <1^MV2- 
Proof. 

Part 1) ^ Suppose checkAsynAbs (MVi ,MV2 . 0) returns true. By inspecting the decision procedure 
we can see this means that a family {C{S) C Step{S) \ S G Y{MVi)} of non-empty sets of valid step terms 
must have been found which is closed under step terms. Consider any abstract trace a G Tr^{MVi); 
then by Lemma [14] and since any trace can be interpreted as a path in SG^{MVi) we have that there 
must exist a path a in SG'^{MV2) such that 0(o;) = a. It is straightforward to see that a must be a 
well-defined trace for MV2, i.e. a G Tr^{MV2), by the definition of valid step term. This shows that 
Tr^{MVi) C (l){Tr^{MV2)) and so by Definition|3we have MVi <^MV2- 

Part 2) <^ Suppose MVi <i'^MV2 then by Definition |9] we know 

r/(MVi) C (j){Tr^{MV2)) (1) 

Then we show that there must exist a family of sets of valid step terms which are closed under step term 
inclusion and thus that checkAsynAbs (MVi ,MV2 ,<p) must terminate returning true. 
Let X C Tr^{MV2) be the set of traces that abstractly correspond to Tr^{MVi): 

X = {a\o' e Tr^{MV2), 3a G Tr^{MVi).^{a') = 0} 

For each S G Y{MVi), let X(5) denote the set of all states that abstract to S which occur at the start of a 
trace in X : 

X(5) = {a'(l) I a'GX, ^{a'{l))=S} 

where cj'(l) represents the first state of trace a'. Let next^^^ (S) = {Si ,S,„}, then using Definition 
[T2]we can define the step term 

st{S,X{S)) = [S:X{S):D{Sy),...,DiS,„)] 

Clearly, st{S,X{S)) must be valid by (1) above. We can now recursively define a set of step terms closed 
under step term inclusion from st{S,X{S)) as follows. 
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Define H {X {S) ) = U,eN H {X {S) ) where H {X (5) ) ; is defined recursively ■.H{X{S))o = {st {S,X{S))} 

and 

H{X{S))i+i = {st{VjMVj)) I [V:r:DiVi),...,D{Vr)]eH{X{S))i, Vj e {Vi, ■ ■ ■ ,Vr}}. 

Clearly, the set H{X{S) ) is closed under step term inclusion by construction. Also note that it can only 
contain valid step terms; this follows from (1) above and the fact that if st{S,r) is a valid step term then 
any new step term 5'f(5',ru {S'}) formed by adding an additional state S' G ^ H*^) must also be valid. It 
therefore follows that for each S G Y{MVi) we know that each step term st{Si,r) G H{X{S)) must occur 
in the initial family C of sets of step terms used in the decision procedure, i.e. st{S,r) G C{S). Since none 
of these step terms can be removed from C by the closure property it follows that the decision procedure 
checkAsynAbs (MVi ,MV2 , (/>) must terminate returning true. □ 



5 Case Study: The Regulation of Tryptophan Biosynthesis 

In this section we present a detailed case study which illustrates the abstraction techniques developed in 
the previous sections. Our case study is based on identifying abstractions for a published MVN model 
of the regulatory system used to control the biosynthesis of tryptophan in E. coli BOil . Tryptophan is 
an amino acid which is essential for the development of E. coli. However, the synthesis of tryptophan 
is resource intensive and for this reason is carefully controlled to ensure it is only synthesised when no 
external source is available. The regulatory network that controls the biosynthesis of tryptophan by E. 
coli has been extensively studied (see for example |[29ll27l ). 
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Figure 5 : An MVN model MTRP of the regulatory mechanism for the biosynthesis of tryptophan in E. 
coli (from ll30ll ). The state transition table for TrpExt has been omitted as this is a simple input entity. 
Note that the state transition tables use a shorthand notation where an entity is allowed to be in any of 
the states listed for it in a particular row. 



Consider the MVN model MTRP for tryptophan biosynthesis presented in Figure [5] which is taken 
from OOl . It consists of four regulatory entities: TrpE - a Boolean input entity indicating the presence 
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of the activated enzyme required for synthesising tryptophan; TrpR - a Boolean entity indicating if 
the repressor gene for tryptophan production is active; TrpExt - a ternary entity indicating the level of 
tryptophan in the external medium; and Trp - a ternary entity indicating the level of tryptophan within 
the bacteria. Note the above entity order is used when displaying global states for MTRP. We can see 
from the model that the presence of tryptophan in the external medium TrpExt directly affects the level 
of tryptophan within the bacteria Trp and that the activated enzyme TrpExt is required to synthesise 
tryptophan. The presence of tryptophan within the bacteria deactivates the enzyme TrpE and at higher- 
levels also activates the repressor TrpR which then acts to inhibit the production of the enzyme TrpE. 

The state space for the MTRP consists of 36 global states and for this reason we do not reproduce its 
state graph here. Instead we simply note that the asynchronous state graph for MTRP comprises three 

disjoint graphs based on the following three attractors: 0000 — > 1000 — > 1001 — > 0001 — ^ 0000; 
001 1; and 0122. To identify abstractions for MTRP we begin by defining appropriate state mappings for 
the non-Boolean entities TrpExt and Trp as follows: 

(^{Trp) = {0^0,1^1,1^ 1}, ^{TrpExt) = {0^^0,1^^ 1,2 1}. 

These can then be combined into an abstraction mapping 

^ = {hrpE, hrpRA {TrpExt), (j) {Trp)). 

Following the approach presented in Q, we first apply this abstraction mapping to MTRP to pro- 
duce a set ^{MTRP) of candidate abstraction models. By analysing ^{MTRP) we are able to estab- 
lish that there are 8 possible candidate abstraction models (we have 4 choices for next-state of TrpR 
and 2 choices for Trp). After investigating these candidate models we were able to identify one valid 
asynchronous abstraction ATRP (which is presented in Figure O for MTRP under ^ using the decision 
procedure checkAsynAbs (ATOP, MTRP, 0). Note that since Tr^{ATRP) and ^{Tr^{MTRP)) are in 
fact finite trace sets in this case we were able to verify the result ATRP <l^ MTRP, by checking that 
Tr^{ATRP) C ^{Tr^{MTRP)) holds. 
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Figure 6: The asynchronous abstraction ATRP identified for MTRP under the state mappings <p{Trp) 
{0 0,1 1,2 1} and (j){TrpExt) = {0 0, 1 1,2 1}. 



The state graph for ATRP consists of two disjoint graphs and has two attractors: 0000 — > 1000 — 4> 

1001 — > 0001 — > 0000; and 0011. It therefore successfully captures two of the three attractors present 
in MTRP. 
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6 Conclusions 



In this paper we have developed an abstraction theory for asynchronous MVNs based on extending the 
ideas developed for synchronous MVNs lH and defined what it means for an MVN to be correctly 
abstracted by a simpler MVN with the same network structure but smaller state space. The abstraction 
approach used is based on an under-approximation approach ||9l |24l in which an abstraction captures a 
subset of the behaviour of the original MVN. We showed that this approach allows positive reachability 
properties of an MVN to be inferred from a corresponding asynchronous abstraction and that all attractors 
of an asynchronous abstraction correspond to attractors in the original MVN. An alternative approach 
would be to use an over-approximation approach ||9l|24l[l0l in which false positives can arise. However, 
the construction of an abstraction model which over-approximates an MVN's behaviour appears to be 
problematic if we wish to remain within the MVN framework (see Section [3]for a discussion of this). 

Directly checking asynchronous abstractions turned out to be problematic given that an asynchronous 
MVN may have an infinite set of traces which makes it infeasible to directly check trace inclusion. To 
address this we developed a decision procedure for checking asynchronous abstractions based on the 
finite state graph of an asynchronous MVN. The decision procedure used step terms to denote possible 
ways to use sets of concrete states to represent abstract states and worked by iteratively pruning the set 
of step terms until either a consistent abstract representation has been found or the set of remaining step 
terms is too small to make it feasible to continue. Importantly, we provided a detailed proof that showed 
the decision procedure worked correctly. Note that as it stands, the decision procedure is inefficient; 
work is on going to refine this procedure and to use it as a basis of a tool for abstraction checking. Such 
a tool will provide the support needed to carry out more complex case studies, for example supporting 
the work currently underway to investigate abstractions for the relatively complex MVN model of the 
carbon starvation response in E. coli presented in 0. 

We illustrated the abstraction theory and techniques developed by considering a detailed case study 
based on identifying a Boolean abstraction for an asynchi^onous MVN model of the regulatory system 
used to control the biosynthesis of tryptophan in E. coli. The abstraction found proved to faithfully 
represent the behaviour of the original MVN and in particular, captured two of the three attractors known 
to exist in the original MVN. The case study illustrates the potential for the abstraction theory presented 
and in particular, how it allows the balance between the level of abstraction used and the tractability of 
analysis to be explored. 

An alternative approach for abstracting MVNs is to reducie the number of regulatory entities in 
an MVN while ensuring the preservation of key properties (see 11211 l37l \T2\ ). This approach seems to 
be complimentary to the one developed here and we are currently investigating combining these ideas. 
Another possible abstraction approach would be to make use of results on modelling MVNs using Petri 
nets |[m [3ll4l[8l and to then apply Petri net absti'action techniques (see for example lOTlfTQllBSl ). Such 
an approach appears promising from an analysis point of view but problematic in that the resulting Petri 
net abstraction may not be interpretable as an MVN and so force the modeller to explicitly use a different 
modelling formalism. 

One interesting area for future work is to investigate automatically constructing abstractions for a 
given MVN and abstraction mapping. Some initial work on restricting the search space for such abstrac- 
tions can be found in f5l but more work is needed here. One idea is to consider developing refinement 
techniques similar to those of CEGAR (Counterexample Guided Abstraction Refinement) lITOl and other 
abstraction refinement techniques |[24l . Closely linked to this idea is the notion of a maximal abstraction, 
that is an abstraction which captures the largest possible behaviour of the original MVN with respect to 
all other possible abstractions for the given abstraction mapping. In future work we intend to investigate 
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developing such a notion and in particular, consider how to automate the construction of such maximal 
abstractions. 
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